Paralinguistic elements in speech synthesis
نویسندگان
چکیده
Corpus based text-to-speech systems currently produce very natural synthetic sentences, though limited to a neutral inexpressive speaking style. Paralinguistic elements are some of the expressive features one would most like to introduce. In this paper, we describe a new method for introducing laughter and hesitation in synthetic speech. Thanks to a small dedicated acoustic database, this method can successfully render transitions between speech and paralinguistic elements. We validate it here for French but extension to other languages should be straightforward.
منابع مشابه
Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques
One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...
متن کاملLinguistic & Paralinguistic Phonetic Variation in Speaker Recognition & Text-to-Speech Synthesis
Phonetic variation, and especially prosodic variation, which is often paralinguistic in nature has gradually attracted more attention among speech researchers and speech scientists as one of the possible solutions to problems with automatic speaker recognition (ASrR) and text-to-speech synthesis (TTS) systems. This paper presents a brief overview of approaches to phonetic variation in ASrR and ...
متن کاملUsing Fuzzy Sets to Model Paralinguistic Content in Speech as a Generic Solution for Current Problems in Speech Recognition and Speech Synthesis
Current problems in speech processing exist due to infinite variations of speech utterances. No two speech utterances are exactly alike, even if they are linguistically the same word. The difference is therefore, due to the paralinguistic content of the speech utterances. This leads to the conceptualization of the paralinguistic content of speech as arising from infinite variation. Infinite var...
متن کاملRobust estimation of multiple-regression HMM parameters for dimension-based expressive dialogue speech synthesis
This paper describes spontaneous dialogue speech synthesis based on multiple-regression hidden semi-Markov model (MRHSMM), which enables users to specify paralinguistic information of synthesized speech with a dimensional representation. Paralinguistic aspects of synthesized speech are controlled by multiple regression models whose explanatory variables are abstract dimensions such as pleasant-...
متن کاملParalinguistic Phonetics in NLP Models & Methods
Natural language processing (NLP) is gradually becoming a more multidisciplinary field, and research in remotely connected aspects of language such as paralinguistic phonetics may benefit from as well as contribute to some areas of NLP. This paper provides a brief overview of paralinguistic phonetics, and some current NLPrelated methods and models used in TTS and ASR systems today. In order to ...
متن کامل